Location tests for biomarker studies: a comparison using simulations for the two-sample case.

نویسندگان

  • M O Scheinhardt
  • A Ziegler
چکیده

BACKGROUND Gene, protein, or metabolite expression levels are often non-normally distributed, heavy tailed and contain outliers. Standard statistical approaches may fail as location tests in this situation. OBJECTIVES In three Monte-Carlo simulation studies, we aimed at comparing the type I error levels and empirical power of standard location tests and three adaptive tests [O'Gorman, Can J Stat 1997; 25: 269 -279; Keselman et al., Brit J Math Stat Psychol 2007; 60: 267- 293; Szymczak et al., Stat Med 2013; 32: 524 - 537] for a wide range of distributions. METHODS We simulated two-sample scenarios using the g-and-k-distribution family to systematically vary tail length and skewness with identical and varying variability between groups. RESULTS All tests kept the type I error level when groups did not vary in their variability. The standard non-parametric U-test performed well in all simulated scenarios. It was outperformed by the two non-parametric adaptive methods in case of heavy tails or large skewness. Most tests did not keep the type I error level for skewed data in the case of heterogeneous variances. CONCLUSIONS The standard U-test was a powerful and robust location test for most of the simulated scenarios except for very heavy tailed or heavy skewed data, and it is thus to be recommended except for these cases. The non-parametric adaptive tests were powerful for both normal and non-normal distributions under sample variance homogeneity. But when sample variances differed, they did not keep the type I error level. The parametric adaptive test lacks power for skewed and heavy tailed distributions.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Scenario-based modeling for multiple allocation hub location problem under disruption risk: multiple cuts Benders decomposition approach

The hub location problem arises in a variety of domains such as transportation and telecommunication systems. In many real-world situations, hub facilities are subject to disruption. This paper deals with the multiple allocation hub location problem in the presence of facilities failure. To model the problem, a two-stage stochastic formulation is developed. In the proposed model, the number of ...

متن کامل

Comparison of two integration schemes for a micropolar plasticity model

Micropolar plasticity provides the capability to carry out post-failure simulations of geo-structures due to microstructural considerations and embedded length scale in its formulation. An essential part of the numerical implementation of a micropolar plasticity model is the integration of the rate constitutive equations. Efficiency and robustness of the implementation hinge on the type of int...

متن کامل

Location of compressed natural gas stations using multi-objective flow refueling location model in the two-way highways: A case study in Iran

Increasing the use of fossil fuels is with severe environmental and economic problems, bringing more attention to alternative fuels. The compressed natural gas (CNG), as an alternative fuel, offers many more benefits than gasoline or diesel fuel such as cost-effectiveness, lower pollution, better performance, and lower maintenance costs. Gas stations location and the number of gas stations are ...

متن کامل

A two-stage GIS-based optimization model for the dry port location problem: A case study of Iran

This article aims to investigate the location of dry ports by providing a two-stage GIS-optimization model. At the first stage, the appropriate points for establishing dry ports were identified using GIS and hierarchical analysis process; then, the suitable points were introduced as the potential points to the second stage model. At the second stage, by providing a multi-objective integer model...

متن کامل

On the Canonical-Based Goodness-of-fit Tests for Multivariate Skew-Normality

It is well-known that the skew-normal distribution can provide an alternative model to the normal distribution for analyzing asymmetric data. The aim of this paper is to propose two goodness-of-fit tests for assessing whether a sample comes from a multivariate skew-normal (MSN) distribution. We address the problem of multivariate skew-normality goodness-of-fit based on the empirical Laplace tra...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Methods of information in medicine

دوره 52 4  شماره 

صفحات  -

تاریخ انتشار 2013